CQ 

O 
00 
CO 

CM 
CO 

in 
o 

CL 

LU 



(19) 



3 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



(12) 



iiiiiiiiiiiiiig 

(H) EP 0 532 380 B1 

EUROPEAN PATENT SPECIFICATION 



(45) Date of publication and mention 
of the grant of the patent: 
07.01.1999 Bulletin 1999/01 

(21) Application number: 92402364.1 

(22) Date of filing: 28.08.1992 



(51) mt ciA- C12N 15/10, C12N 15/81, 
C12N 15/11, C12N 15/54 
//C12N 15/82, C12N 15/85 



(54) Method for synthesizing stable, single-stranded cDNA in eukaryotes by transfection with a 
vector containing a bacterial retron and the vector used therefor 

Verfahren zur Synthese von stafailer t einzelstrandiger cONS in Eukaryoten durch Transfektion mit 
einem bakteriellen "Retron" enthaltenden Vektor und der verwendete Vektor 

Methode pour synth^tiser un cDNA simple brin stable dans les eucaryotes par transfection avec un 
vecteur contenant un "retron" bacterien et le vecteur utilisable dans la methode 



(84) Designated Contracting States: 
DE FR GB 

(30) Priority: 30.08.1991 US 753110 

(43) Date of publication of application: 
17.03.1993 Bulletin 1993/11 

(73) Proprietor: UNIVERSITY OF MEDICINE AND 
DENTISTRY OF NEW JERSEY 

Newark, New Jersey 07107-3007 (US) 

(72) Inventors: 

• Miyata, Shohei 
Misato.shi, Saitama (JP) 

• Ohshima, Atsushi 
Uji-shi Kyoto-fu (JP) 

• Inouye, Sumiko 
Bridgewater, NJ 08807 (US) 

• inouye, Masayori 
Bridgewater, NJ 08807 (US) 

(74) Representative: Gillard, Marie-Louise et al 
Cabinet Beau de Lomenie 

158, rue de I'Universite 
75340 Paris Cedex 07 (FR) 



(56) References cited: 
US-A- 5 079 151 

• ANNUAL REVIEW OF MICROBIOLOGY vol. 45 , 
1 991 , PALO ALTO, USA pages 1 63 - 1 86 INOUYE, 
M. & INOUYE, S. msDNA and bacterial reverse 
transcriptase' 

• TIBS TRENDS IN BIOCHEMICAL SCIENCES vol. 
16 , January 1991 , CAMBRIDGE EN pages 18 - 
21 INOUYE, M. & INOUYE, S. 'Retroelements in 
bacteria' 

• SCIENCE (WASHINGTON, D. C), 243, 1033-8 24 
February 1989 pages 1033 - 1038 LAMPSON, 
BERT C. ET AL 'Reverse transcriptase in a 
clinical strain of Escherichia coli: production of 
branched RNA-linked msDNA' 

• PROC. NATL. ACAD. SCI. U. S. A., 89(1 3), 5735-9 ; 
1 July 1992 MIYAT, SHOHEI ET AL In vivo 
production of a stable single-stranded cDNA in 
Saccharomyces cerevisiae by means of a 
bacterial retron' 



Note: Within nine months from the publication of the mention of the grant of the European patent, any person may give 
notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall be filed in 
a written reasoned statement. It shall not be deemed to have been filed until the opposition fee has been paid. (Art. 

99(1) European Patent Convention). Applicants: Michael Wayne Graham and 

Robert Norman Rice 

Printed by Jouvb, 75001 PARIS (FR) g^, Nq . j 0,759^4 , 

Filed: January 15, 2004 



Exhibit 4 



BP 0 532 380 B1 



Description 

FIELD OF. THE INVENTION 

The invention concerns the field of recombinant DNA. More particularly, the invention relates to an in vivo method 
of synthesis of stable single-stranded cDNA in eucaryotic cells by means of a bacterial retroeiement called a retron. 
The invention also relates to new eucaryotic vectors carrying the necessary elements to produce the single-stranded 
DNA-RNA hybrid structures. Moreover, the invention relates to transfected eucaryotes. e. g. , yeast, plant cells and 
mammalian cells. Uses are described for the new products. 

BACKGROUND 

Gram-negative bacteria such as Myxococcus xanthus , Stigmatella aurantiaca and Escherichia coii have been 
found to contain a retroeiement called a retron . In TIBS , 16, 18-21 (1991a), the authors report on a peculiar type of 
satellite DNA, named multicopy single-stranded DNA (msDNA). These molecules are characterized by a structure 
which comprises a single-stranded DNA branching out of an internal guanosine residue of a single stranded RNA 
molecule by a unique 2',5"-phosphodiester linkage. These molecules are thus single-stranded DNA-RNA hybrids. Re- 
verse transcriptase is required for the synthesis of these msDNAs. In Ann. Rev. Microbiol. , 45, 163-186 (1991b), the 
authors present a comprehensive review on msDNAs. Also see msDNA in Bacteria, Lampson et aL , Progress in 
Nucl eic Acid Research and Molecular Bio logy, 60, 1-24. 

The production of single-stranded cDNA by reverse transcriptase as a template is an obligatory step for RT-me- 
diated transcription of retroelements. See Retroelements . See Weiner et al., Ann. Rev. Biochem. , 55, 631-661 (1986) 
for review. This includes integration of retroviruses into mammalian genomes, production of infectious retroviruses 
from pro-viruses integrated into genomes, retrotransposition of retroelements, and formation of pseudo genes in eu- 
caryotic cells. 

However, single-stranded cDNAs produced in viv o by RT have never been directly detected, probably because of 
their instability. 

While the production of msDNAs in bacteria has been a most significant development, the in vivo production of 
single-stranded DNAs in eucaryotic cells, e^., yeast or higher eucaryotic cells like plant and mammalian cells, is of 
even greater interest. Eucaryotes have well-known advantagesoverprocaryot.es for producing target molecules. There 
is an important need to produce stable single-stranded DNA in a sufficient yield for numerous practical uses in research 
and in industry. This invention has made an important contribution in that respect in producing single-stranded RNA- 
DNA structures which are detectible, stable and useful. 

SUMMARY OF T HE INVENTION 

in accordance with the invention, a fundamental finding has been made. It has been discovered that single-stranded 
DNAs which are stable can be produced in vivo in eucaryotic cells. 

The invention relates to a method of producing a stable msDNA in a eukaryotic cell ; a DNA expression vector ; 
and a eukaryotic host cell transformed with said vector as defined in the claims and illustrated below. 

Briefly described, the invention provides a method (or process) for producing in vivo stable, single-stranded DNAs 
in eucaryotic cells like yeasts or plant cells or mammalian cells. The method of the invention produces a single-stranded 
cDNA by means of a retroeiement called a retron. The single-stranded DNA is produced as an integral part of a branched 
RNA-linked multicopy single-stranded DNA (msDNA) structure. These structures are stable, Le., detectible after pro- 
duction and isolation in spite of the fact that they are constituted of RNA and DNA, both singie-stranded. By the method 
of the invention also provides such msDNAs which contain foreign DNA and RNA fragments in the DNA and RNA 
portions, respectively, of the RNA-DNA structure can be produced. Though different from the known bacterial msDNAs, 
these molecules are designated msDNAs or "modified" msDNAs, because they have the characteristics and unique 
features of msDNAs as described herein. 

The method of the invention also employs retrons. Retrons are genetic elements which contain the coding region 
msr for the msRNA and msd for the msd DNA of the msDNA molecule, respectively, and the gene for reverse tran- 
scriptase (RT). The retrons have sequences which are different from known bacterial retrons in that the non-coding 
region has been shortened, specifically the region between the transcriptional initiation site of the selected promoter 
and the initiation codon of the RT gene. 

The invention also provides DNA expression vectors containing retrons which differ from known bacterial retrons 
in that, the RT gene is positioned upstream of the msr-msd region, in reverse relationship of that in bacterial retrons. 
These new retrons produce greater vields of msDNAs. 

The method of the invention produces msDNAs which contain a foreign DNA fragment in their DNA portion, for 
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instance, a single-stranded fragment complementary to the mRNA of a particular target gene (antisense DNA) and 
thus, may be valuable tools to inhibit or change the expression of undesirable proteins. Similarly, the msDNA can also 
contain a foreign RNA fragment. 

Further novel embodiments of the invention are eucaryotic hosts transformed with the DNA expression vector 
discussed above. 

Various uses for the single-stranded RNA-DNA structures are described, but do not form part of the invention. 
DEPOSIT OF GENETIC MATERIAL 

Plasmid YEp521-M1 has been deposited with the American Type Culture Collection (ATCC) under Accession No. 
74092. 

Plasmid YEp521-M4 has been deposited with the ATCC under Accession No. 74093. 
Piasinid YEp521-M5 has been deposited with the ATCC under Accession No. '14094. 

BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 illustrates the biosynthetic pathway of msDNA synthesis. 

FIG. 2A shows the structure of the typical bacterial msDNAs. FIG. 2B shows the structure of rnsDNA-Ye117. 

FIG. 3 shows the arrangement of genes in the retroelement responsible for the production of msDNAs. 

FIG. 4 shows a comparison of the domain structures of various bacterial RTs. 

FIG. 5 shows piasmids YEp51 and YEp52. 

FIG. 6 shows the restriction map of the 11.6-kb EcoR1 fragment. 

FIG. 7 shows a diagrammatic representation of plasmid PC1-1BPv4, YEp521-M1, YEp521-M2, YEp521-M3 
YEp521-M4 and YEP521 M5. 

FIG. 8A shows bands a and b of a sequence polyacrylamide gel for the production of msDNA-Ec67 and FIG. 8B 
shows a schematic representation of extension of the 3' end of msDNA by AMV-RT and RNase A treatment. 

FIG. 9 shows Southern blot hybridization of msDNA-Ec67 produced in S. cerevisiae . 

FIG. 10 shows a diagrammatic representation of plasmid YEp521-M5. Darkened region in the retron represents 
50-bp antisense DNA for cdc28 (cloned into the Xhol site) inserted into the msd region of retron Ec67. Also shown is 
the 50-bp antisense DNA for cdc28. 

DETAILED DES CRIPTION OF THE FIGURES 

FIG. 1 Biosynthetic pathway of msDNA synthesis. The retron region consisting of the msr -msd region and the 
gene for reverse transcriptase (RT) is shown on the top of the Figure. Solid arrows indicate the locations of two sets 
of inverted repeats (a1 and a2, and b1 and b2). Open arrows indicate the genes for msdRNA (msr ), msDNA ( msd ), 
and RT. The primary transcript is considered to encompass the upstream region of msr through the RT gene, which is 
shown by a thin line at step 1 . The thick region in the RNA transcript corresponds to the final msd RNA. The branched 
G residue is circled, and the initiation codon for RT is also shown. On the folded RNA, a triangle indicates the 5' end 
processing site at the mismatching base. The dotted lines at steps 3 and 4 represent DNA strands. 

FIG. 2 (A) Structures of hybrid DNA-RNA msDNAare shown as follows: Mx162, Mx65, Sa163, Ec107, Ec67, Ec86 
and Ec73. ( Ann. Rev. Microbiol. , 45, 163-186 (1991)) (B) Structure of hybrid DNA-RNA msDNA-Ye117 is shown. The 
hatched zone represents the anti-sense DNA of cdc28. 

FIG. 3 Arrangement of genes in the retron element responsible for the production of msDNA. A single-copy retro- 
element on the bacterial chromosome contains the region required for the production of msDNA. All known msDNA 
coding regions contain three genes organized in a similar manner, as shown in (A): A gene, msd , codes for the DNA 
strand of msDNA. A second gene ( msr ) is situated, 5" to 3', in the opposite direction and codes for the RNA strand of 
msDNA. A closely positioned ORF codes for the RT. Transcription of this region initiates at or near the 5' end of msr 
and extends beyond msd to include the ORF. A set of inverted repeat sequences, a1 and a2, is also conserved among 
msDNA coding regions (short arrows). The circled G corresponds to the residue in the RNA that will contain the 2\ 5' 
branch linkage in msDNA (see also FIG. 8A). (B) For the E. coli retron Ec67, the region encoding msDNA is only a 
small part of a large element found on the chromosome (open bar). The junction of the Ec67 retron with the host 
chromosome is flanked by 26-base directly repeated chromosomal sequences, as shown by arrows. The Figure is not 
drawn to scale. 

FIG. 4 Domain structures of various bacterial RTs. The regions with closed bars and with stippled bars represent 
the RT and RNase H domains, respectively. 

FIG. 5 Yeast expression vectors YEp51 (7.3-kb) and YEp52 (6.6-kb). The structures of two yeast expression vectors 
are diagrammed. Both are composed of sequences from the yeast plasmid 2-u,m circle (smooth single line) spanning 
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REP3 and the origin of replication (_ ), from the bacterial plasmid pBR322 (jagged singie line) spanning the 
CoJE1 origin of replication and the gene conferring ampicillin resistance, from the yeast genome spanning the gene 
LEU2 ( c2 ), and from the region 5' to the yeast GAL 10 gene ( cr ), extending from the Sau3A site at -495 from the 
transcription-initiation site to the Sail site present in plasmid pNN78-A4 at +13. A cloned gene inserted in YEp51 in 
the Sai l. Sali-to-BarnHl, Sall-to-Hindlll, or Sall-to-Bcll sites pointed labelled I in the Figure, terminating at a site in the 
2-um-circle sequences indicated by the blocked arrow (T). Similar transcription would be obtained with genes inserted 
in the Hindlll or Hindi 1 1 to Bell sites of YEp52. Restriction enzymes: R, EcoRI; H, Hindi!!; B, BamHI; S, Sail, P, Pst l; 
Be, Bel l. See Broach et aL, Experi mental Manipulation of Gen e Expression, Academic Press Inc., New York, 1983. 

FIG. 6 Restriction map of the 11.6-kb EcoRI fragment. In the CI-1E map, the left-hand half (EcoRI to Hindlll) was 
not mapped. In the CM EPS map, the locations and the orientations of msDNA and msdRNA are indicated by a small 
arrow and an open arrow, respectively. A large solid arrow represents an ORF and its orientation. See Lampsonet aL, 
Science , 243, 1033-1038 (1989). 

FIG, 7 Diagrammatic representation of plasmid PC1-1BPv4, YEp521 -M 1 ,-M2,-M3,-M4 and M-5. Diagrams show 
only the regions (shaded bars) inserted in the yeast vector, YEp521. These regions contain retron-Ec67 and restriction 
sites shown are only those which are used for the construction of plasmids. Short arrows with msr or msd are the 
locations and the orientations of msdRNA and msDNA. Long arrows with RT represent the gene for RT and its orien- 
tation, Thick arrows represent the GAL 10 promoter and its orientation of transcription. Letters on top of bars are the 
sites of restriction enzymes: H, Hindlll; Ba, Bail; Pv, Pvu H; B, Bam HI; and S, Sma l. 

FIG. 8 A sequence polyacrylamide gel of the production of msDNA-Ec67 in S. cer evisiae . 

(A) Total RNA prepared from 0. 9mi of a late-log culture was used for detecting msDNA with AMV-RT as described 
herein below. The RT reaction mixture was subjected to electrophoresis on a 6% sequence-urea-gel. An aliquot 
of the reaction mixture was treated with RNase A prior to gel electrophoresis. Lanes 1 and 2 (G and C lanes, 
respectively) are DNA sequence ladders of pUC19 sequenced by chain termination method (Sanger et aL, Proc. 
Natl. Acad. ScL USA , 74, 5463-5467 (1977)) for size marks; lane 3 r the AMV-RT products with total RNA from 
yeast cells harboring YEp521-M1; Jane 4, the same sample as lane 3 except that it was treated with RNase A prior 
to gel electrophoresis; lane 5, the AMV-RT products with total RNA from yeast cell harboring YEp521 . The sample 
was treated with RNase A. Lane 6 is an Msp l digest of pBR322 labeled with [r 32 P]dCTP with the Klenow fragment 
of DNA polymerase 1. Numbers at the right-hand side indicate fragment sizes in base pairs and arrows with letters 
indicate positions of msDNA. 

(B) Schematic representation of extension of the 3' end of msDNA-Ec67 by AMV-RT and RNase A treatment. 
FIG. 9 Southern blot hybridization of msDNA-Ec67 produced in S. cerevisiae. 

(A) Total RNA fractions prepared from a 2. 5ml culture of yeast cells harboring YEp521-M1 (lane 1 ), and YEp521~M2 
(lane 2) and from E. coli CL83 harboring pCL-1 EP5c (lane 3) were used. After blotted to the nylon membrane filter, 
msDNA Ec67 was detected with the nick-translated 140-bp msr - msd DNA fragment as a probe. An arrowhead 
indicates the position of msDNA-Ec67. 

(B) Production of msDNA-Ec67 in S. cerevisiae harboring YEp521-M1, -M3, and-M4, Total RNA fractions prepared 
from a 2. 5ml culture of yeast cells harboring YEp521-M1 (lane 3),~M3 (lane 2), and-M4 (lane 1) were used for 
Southern blot hybridization as described in (A). An arrowhead indicates the position of msDNA- Ec67. 

FIG. 10 YEp521-M5 was constructed from YEp521~M4 by inserting into the msd region an Xho l site and into that 
site, cloning a 50-bp extraneous (foreign) dsDNA fragment which is complementary to mRNA of cdc28: step 1. The 
Xhol site was added into the msd region of YEp521-M4 by PCR. This construct was then digested by Xhol; then the 
antisense DNA was ligated into the msd region of retron Ec67: step 2. This plasmid was transformed into yeast (SP~ 
1) and the subsequently expressed msDNA designated herein as msDNA-Ye117. This is a novel structure. 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS OF THE INVENTION 

In accordance with this invention, it has been discovered that a transfected yeast, Saccharomyces cerevisiae , 
produces a genetic structure, described as a synthesized, branched RNA-linked multicopy single-stranded DNA (msD- 
NA). One such msDNA produced was msDNA-Ec67. msDNA-Ec67 was synthesized from retron-Ec67. 

The production of other representative msDNAs is described. 

Several msDNAs have been described in the literature. Some of these are the following: Mx162 (Dhundaleet ai., 
Cell , 5J., 1105-1112, 1987); Mx65 (Dhundale et aL, J, Biol. Chem. , 263 , 9055-9058, 1988b); Sa163 (Furuichi et alj 
Cell, 48, 47-52, 1987a and Furuichi et aL, Cell, 48, 55 62, 1987b); Ec67 (Lampson et aL, Science , 243 , 1033-1038, 
1989b); Ec86 (Lim and Maas, Cell, 56, 891-904, 1989); Ec73 (Sun et aL, J. Bacterio l 173, 4171-4181, 1991); Ec107 
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(Herzer et al., Mol. Microbioi. , submitted, August 1 991 ); msDNA from E. coliB (Lim and Maas, Cell, 56, 891-904, 1 989). 

The msDNAs are often referred to in the literature by a numeral preceded by a suffix indicating a host origin. For 
instance, "Mx" referring to Myxococcus xanthus , and "Ec", referring to E. coli and "Sa" to Stigmatelia aurantiaca. 

msDNAs are unique molecules which in spite of extensive diversity, share similar structural features. Genericaily, 
s msDNA may be described as being a molecule which comprises a branched RNA which is covaiently linked to a single- 
stranded DNA by a 2',5'-phosphodiester bond between the 2'-OH group of an internal rG residue and the 5'-phosphate 
of the DNA molecule, and which RNA is non-covalently linked to the DNA by base pairing between the complementary 
3' ends of the RNA and DNA molecules, which RNA and DNA form the stable stem-loop secondary structures. The 
msDNA molecule is encoded by a primary RNA transcript, pre-msDNA, which contains an open-reading frame (ORF) 
w downstream of the msr locus encoding a polypeptide which has sequence similarity with retroviral RTs and a highly 
conserved sequence common to RTs. 

The pre-msDNA may alternatively contain its ORF upstream of the msr locus, in which event the retron will be of 
like construction. 

In FIG. 2 which shows typical msDNAs, the RNA portion of the molecule is shown "boxed"; the balance of the 
15 structure being the ssDNA portion. 

it will be noted that the molecules all show a branched rG residue, a DNA-RNA hybrid at the 3' ends of the msDNA 
and msdRNA, and a stem-loop structure in the RNA and DNA strands. The branching ribonucleotide, G, is circled and 
the 2',5'-phosphodiester linkage to the first deoxynucleotide is indicated. 

Retrons: a retron is a small genetic element to date found to be of 1.3 to 2.5-kb in length constituted of an msr- 
20 rnsd region and by the gene for encoding reverse transcriptase (RT). The coding region for msDNA is indicated by 
" msd "; the coding region for msdRNA is indicated by " msr ". 

A comparison of all known msDNA coding regions reveal that this locus contains three genes organized in a similar 
manner (see FIG. 3). A gene called msd codes for the DNA portion of msDNA. A second gene, msr , is situated 5' to 
3', in the opposite orientation of msd, and codes for the RNA chain. Thus the genes msd and msr are convergently 
26 oriented so that their respective 3' ends overlap by several bases. 

This overlap is equivalent to the H-bonded DNA-RNA structure formed by the overlapping 3' ends of the RNA and 
DNA strands in the msDNA molecule. For Mx162, the overlapping msd -msr genes, like the hybrid structure of the 
msDNA they produce, comprise 8 base pairs. See Table I for typical overlap lengths of various msDNAs. 

Determination of the nucleotide sequence in the vicinity of the msd-msr genes revealed a closely-linked open 
30 reading frame (ORF). This ORF is located immediately upstream from msd , but is transcribed in the same direction 
as ™sr ( as shown in FIG. 7). The initiation codon of the ORF is situated as close as 19 basepairs from the start of the 
msd gene for the Ec86 retron of E. coli B, but as much as 77 base-pairs for the Mx162 retron of M. xanthus . 

Another conserved feature of the chromosomal locus that codes for msDNA is a set of inverted repeat sequences, 
designated a1 and a2. Sequence a1 is located just upstream from the start of the msd gene, while sequence a2 is 
35 positioned immediately 5' to the G residue in the msr gene that forms the 2", 5' branch linkage in the msDNA molecule 
(FIG. 4). The inverted repeats display a large degree of nucleotide sequence diversity among the different known loci 
encoding msDNA, as well as differences in size. For example, the inverted repeats (a1 and a2) found in the retron 
locus encoding Mx162 are 34 nucleotides long, while the inverted repeats for the Ec86 retron of E. coli B are only 12 
bases in size. Despite their diversity, these repeat sequences are located in the same positions (as shown in FIG. 3) 
40 for all known loci encoding msDNA. As discussed in more detail below, the position of these inverted repeat sequences 
is critical to the synthesis of msDNA. 

It will be helpful to refer to the above discussion when the aspects are discussed which provide for an inversion 
in the organization (position inversion) of the RTgene with respect to the msr-msd coding region, and in the discussion 
of shortening the non-coding region between the transcriptional initiation site and the initiation codon AUG of the RT 
45 gene. 

The promoter for the msr - msd region is upstream of msr . Transcription is from left to right, encompassing the entire 
region including the RT gene. As described in further detail hereinafter, the replicating vehicle for transfecting the 
eucaryote host may harbor one promoter for the msr - msd and the RT, or it may contain two promoters, one for the 
msr - msd region and the other for the RT. 
so in accordance with the invention retrons may be constructed to yield an msDNA which differs from the typical 

msDNAs by features other than the common, conserved and characteristic features of msDNAs described above. For 
instance, it is not excluded that the length and/or location of the set of IRs a1 and a2 in the retrons be varied providing 
they remain in the same location discussed above. Thus, the size and/or location of the loops in the stems of the 
msDNA can be varied. 

55 Further, it is not excluded that the extent of the the overlap of the base pairing of the 3' ends of DNA and RNA in 

the msDNAs be influenced (increased or decreased) by appropriate manipulations. Whether such variations will be 
desirable will depend on the ultimate utility proposed for these msDNAs. 

General Features of msDNAs. Table i is a summary of the structure of representative retrons. 
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Reverse Transcriptase (RT).The domain structures of bacterial RTs of representative retrons are shown in FIG. 4. 

The RT gene is normally located downstream from the msr-msd region, in the new retrons which differ from the 
bacterial retrons, their relative positions are reversed, the msr-msd region is located downstream of the RT gene, 

The biosynthesis of msONAs has been described by Inouye & Inouye, Ann. Rev. Microbiol. , 45, 163-186 (1991b) 
and Herzer et aj., Mol. Microbiol. , submitted, August 1991. A schematic of the synthesis is shown in FIG. 1 . A primary 
transcript (pre-msdRNA) is considered to encompass the upstream region of msr through the RT gene, which is by 
reference to FIG. 1, shown by a thin line at step 1. The thick region in the RNA transcript corresponds to the final 
msdRNA. The branched G residue is circled, and the initiation codon for RT is also shown. On the folded RNA, a 
triangle indicates the 5' end processing site at the mismatching base. The dotted lines at steps 3 and 4 represent DNA 
strands. 

In summary, the primary transcript from the msr - msd region is believed to serve not only as a template but also 
as a primer to produce the msDNA. Synthesis of msDNA is primed from an internal rG residue of the RNA transcript 
using its 2'-OH group. Thus, msDNA is branched out from this rG residue by a 2-5*-phosphodiester linkage. 

There will be described hereinafter the transformation of yeast cells harboring plasmids which contain a retron 
(which includes the RT gene) for expression of the desired msDNAs. The description is of a best mode to date to 
express msDNA- Ec67 from its retron, Ec67. 

1. Synthesis of msDNA-Ec67. For the expression of msDNA-Ec67 t plasmid YEp52 was used. Plasmid YEp521 
was constructed by introducing the multiple cloning sites of pUC19 (Yanisch-Perron et aL, Gene, 33, 103-119, 1985) 
into YEp52, which was designed to obtain high-level, inducible expression of a cloned gene under the GAL 10 promoter 
in yeast. YEp52 contains the ColE1 origin of replication (OR), a promoter of theGALIO gene, LEJJ2, the 2p-circie origin 
of replication, and the 2-u. circle REP3 locus (SEE FIG. 5; See Broach et aL, Experiment al Manipulation of Gene 
Expression, Academic Press Inc., New York, 1983). 

Retron-Ec67 was prepared from plasmid pCL-1BPv4 in which the 4-kb Ball - Pvull fragment (DNA from fragment 
from the Ball to 2nd Pvull site from the left end of the map depicted in FIG. 5 was cloned into the Hind i site of pUC9. 
E. coii harboring this plasmid produces msDNA-Ec67 (Lampson et aL, Science , 243, 1033-1038, 1989). 

A total RNA fraction was prepared from cells transfected with pC1-1EP5c; pC1-1EP5c contains the 5-kb Pstl(a)- 
§c°. R! fragment encompassing the entire 4-kb Ball-Pvuli sequence in PC1-1BPv4. See FIG. 7. 

The construction of plasmid YEp521 proceeded as follows. The DNA fragment containing the pUC19 multiple 
cloning sites were isolated by digestion of pUC19 with EcoRI, the cleaved ends were filled in with the Klenow fragment 
of DNA polymerase 1, and then digested with Hind III. The resulting 54-bp fragment was cloned into YEp52 by replacing 
a fragment between the Bell (filled in with the Klenow fragment) and Hindi!! sites, resulting in YEp521. 

YEp521, thus constructed, contains the multiple cloning sites from pUC19, except for EcoRI, downstream of the 
GAL 10 promoter. 

The 4-kb Hindjll- Bam HI fragment from pC1-1BPv4 (see FIG. 7) was cloned into the Hindill and BamHI sites of 
YEp521. "~ " 

As a result, the msr -msd region and the RT gene of retron-Ec67 were placed downstream of the GAL10 promoter. 
This plasmid is designated YEp521-M1. 

Plasmid YEp521-M1 is illustrated in FIG. 7. The shade bars are the regions inserted in yeast vector, YEp521. 

It will be noted that the RT of retron-Ec67 gene is located behind (downstream) the msr - msd region. 

2. Production of msDNA in transformed yeast. A yeast strain (SP1: a ura3.leu2,trp1,his3,ade8,can r ,gat2) was 
used. Transformation of the yeast cells was carried out by the lithium acetate method of (Itoet aL, J. Bacteriol , 153, 
163-168, 1983). Yeast culturing was carried out as described below. msDNA was produced and was detected by 
extending the 3' end of msDNA by avian myeloblastosis virus reverse transcriptase (AMV-RT). This yielded a main 
product of 117 nucleotides. Treatment of this product with ribonuclease A resulted in a DNA of 105 nucleotides. These 
results are in good agreement with the structure of msDNA-Ec67 (see Lampson et aL, Science , 243 , 1 033-1 038, 1 989). 
The production of msDNA-Ec67 was further conformed by Southern blot hybridization. 

To determine whether the production of msDNA-Ec67 could occur in yeast without the RT genes from retron Ec67, 
the following work was performed. 

An RNA preparation from cells harboring YEp521-M2 only containing the msr - msd region under the GAL 10 pro- 
moter, and described herein above (see FIG. 7) was analyzed by Southern blot hybridization. As shown in lane 2, FIG. 
9, no band corresponding to msDNA-Ec67 was detected, indicating that the RT gene from retron-Ec67 is essential for 
the msDNA synthesis in yeast cells. 

In a similar manner, CHO cells can be transformed using known strategies and techniques. The same could be 
done with HeLa cells or other vertebrate mammalian cells. 

Gene Rearrangement in Retrons. An important finding in connection with the invention is that the yield of msDNA 
in transfected yeast cells is significantly improved by means which cause, it is believed, an increase in production of 
RT. One such strategy is to reduce by as many as possible the numbers of the AUG codons between the transcriptional 
initiation site of the GAL 10 promoter (or any other promoter used for that purpose) and the initiation codon of the RT 
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gene. Best results were obtained when a portion of the 5' end non-coding region containing initiation translation codons 
AUG is deleted, but for the first AUG codon in closest proximity to the 5' end. 

Thus, it was found that a significant portion of the 5' end of the non-coding region was not essential to production 
of msDNA in yeast ceils. Deletion of a portion of the nucleotide sequence containing the AUG codons significantly 
5 improved the yield of msDNA production. 

Specifically in Y£p521-M1 (see FIG. 7), there are 417 -bp from the 5 1 end Hindi II site to the initiation codon GAA 
for RT. (See FIG. 7 in Lampson et aL, Science , 243 , 1033-1038, 1989). A deletion of the 240-bp sequence upstream 
of the msr gene from the left hand most Hindlll site to immediately upstream of msr of YEp521-M1 (see FIG. 7) was 
carried out 

10 For this purpose, the fragment of 1 40-bp msr - msd (including 5 extra bases upstream of msd and 18 extra bases 

at the 3' end of the msr-msd region upstream of msd) (after PCR amplification) and the 1.8-kb RT gene (including 8 
bases upstream) of the initiation codon of the RT gene (also after PCR amplification) (and 4 bases downstream of the 
termination codon), were cloned into the Hindlll and Bam Hi sites of YEp521~M1 yielding YEp521-M3 (see FIG. 7). 
The yield of msDNA-Ec67 in transfected yeast with YEp521-M3 was shown to be significantly increased, as discussed 

is below. 

Another important finding made to substantially increase the yield of msDNAs in yeast is to transpose the position 
of the RT gene with respect to the msr - msd region. In bacterial retrons, the msr-msd region is in front of the RT gene; 
when the RT was moved upstream of the msr - msd region, a further increase in yield of msDNA was observed. This 
was accomplished as follows. 

20 Since the msr-msd region of YEp521-M3 still contains 3 AUG codons, YEp521-M4 (see FIG. 7) was constructed, 

in which the order of the RT gene and the msr-msd region was reversed, Le., the m^- rnsd region being positioned 
after the RT gene. In YEp521-M4, there is only one AUG codon between the left hand-most Hindlll site and the Bam HI 
site (see FIG. 7), which exists in the multiple cloning sites of PUC19 (Yabisch-Perron et aL, Gene , 33, 103-119, 1985). 
This AUG codon is terminated by a termination codon, UAG after 5 codons. The initiation codon, GAA, for the RT gene 

25 was placed 6 codons after the termination codon in the same reading frame. 

YEp-M5 was then constructed from YEp521-M4 by adding 50-bp antisense DNA for cdc28 into the region coding 
for msd (described in further detail below). Therefore, two plasmids were constructed in which the order of the RT gene 
and the msr-msd region was reversed. 

The yield of msDNA-Ec67 in yeast cells transfected with YEp521-M4 was compared with that of yeast cells trans- 
it? fected with YEp521~M3. YEp521-M4 brought about a further increse (about 8 fold) of yield over YEp521~M3. 

It has been reported that a ribosomal subunit (carrying Met-trNA mel and various initiation factors) binds initially at 
the 5' end of mRNA and then scans through the mRNA stopping and then initiates translation at the first AUG codon 
in a favorable context (Kozak, J. Cell Biology, 108 , 229-241 , 1989). From a recent survey of 699 vertebrate mRNAs, 
GCCGCCACCAUG emerges as the consensus sequence for initiation of translation in higher eucaryotes (Boekeet 

36 §L> Cell, 40, 491-500, 1985). The survey reports the study of the 5" non-coding sequences of the 699 vertebrates 
mRNAs (all sequences to which access could be had in the literature). The mRNA source of the vertebrates included 
human (muscle, skeletal, liver, intestinal, etc.), bovine, rat and others. Also in yeasts, AUG was reported to be the 
consensus sequence for initiation of translation. (Hamilton et al, Nucl. Acids Res. , 15, 3581-3583, 1987) It is noted 
that Kozak, J. Cell Biology 108 , 229241, 1989 also reported variations and exceptions to the more general rule de- 

40 scribed above. For instance, there are reported cases where initiation is not restricted to the first AUG codon, which 
therefore is not used exclusively, but includes other AUG codons in the vicinity of the 5" end. Further, inactivating the 
first AUG codon closest to the 5" end, allowed ribosomes to initiate translation at another codon (UUG). 

As described herein above, the location of the initiation codon of the ORF for various msDNAs can vary (e.g. , 
19-bp from the start of the msd gene for Ec86 retron and 77-bp for the Mx162 retron). Thus, one skilled in the art can 

45 adjust the length of the excised non-coding region of the retron when the above strategy is followed. 

The finding in connection with the invention described above, namely, that a significant improvement in yield of 
msDNAs takes place when AUG codons between the transcriptional site of the GAL 10 promoter and the initiation 
codon of the RT gene are deleted, but for the one AUG codon closest to the 5' end which is preserved, is therefore 
consistent with the above-discussed literature reports. Accordingly, this finding made in accordance with the invention 

50 with respect to the production of msDNAs is not intended to be limited to yeast, but can reasonably be predicted to 
apply to other msDNA-producing transfected eucaryotes, in particular higher eucaryotes like mammalian ceils, e.g. , 
HeLa cells, CHO, COS-1 cells and others. 

The same observation can be made regarding the position of the RT gene upstream of the msr - msd region. This 
finding too is believed to have general applicability to the production of msDNAs in eucaryotes, as noted above. It is 

55 believed that these described strategies may contribute to an increase in RT and ultimately in yield of msDNAs. 

It will be apparent to one skilled in the art that the two strategies described (deletion of AUG codons and inversion 
of the respective positions of the RT gene and the msr - msd region, do not have to be performed together (as shown 
with respect to YEp521-M4), which is a best mode to date. For instance, the strategy may be performed without the 
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deletion strategy, and vice-versa. 

The m sDN As which are synthesized from these new retrons are also new. 

As has been noted herein, it is not necessary that one promoter for the RT gene and the msr-msd region be used. 
More than one can be used, one for the RT and one for the msr-msd region. When it is desired to use two promoters, 
either one or both of the strategies to increase RT production namely the inversion and/or deletion strategy can also 
be used, as will be apparent to one skilled in the art. 

It is noteworthy that the DNA sequences, which contain these unique retrons (due to the deletions and/or position 
inversion) and which encode the new msDNAs, are new when compared to known bacterial retrons. So are the repli- 
cating vehicles carrying these retrons and the transfected eucaryotes harboring these vehicles. They provide effective 
means to produce new single-stranded DNA in eucaryotes in improved yields. 

It is to be noted also that the two above-described strategies which have been discussed with respect to eucaryotes 
are applicable to msDNAs produced from modified retrons in procaryotes. 

The invention has been illustrated with an illustrative retron, Ec67. However, by a similar procedure, yeast can be 
made to produce other msDNAs. For instance, in a similar manner, retron Ec73 can be used to transform yeast strain 
SP1 to produce msDNA-Ec73. 

Likewise, a simitar procedure can be followed to transform and produce msDNA-Mx65 in yeast from the necessary 
retron elements. See Dhundale et al„ JBC , 263, 9055-9058, 1988. Its ORF codes for 427 ammo acid residues. 

If it is desired to produce msDNA~Mx162 in yeast, the appropriate DNA fragment containing retron Mx162 can be 
prepared from a 17.5~kb Sail fragment which is disclosed in Yee etaL, Cell, 38, 203-209, 1984. Its ORF codes for 485 
amino acid residues. 

For the expression of msDNA-Ec107, a similar strategy may be followed. The retron is a 1 .3-kb DNA fragment of 
which the 34-bp intergenic sequence between pyrE and ttk (in FIG. 4) is deleted. The retron contains an ORF coding 
for 319 amino acid residues (from base 396 to 1352 in FIG. 2). The reference to Figures made hereinabove is to 
Dhundale et aL, Cell, 5jL 1105, 1987. This retron is the smallest yet found in bacteria. 

The retron for Sa163 was determined to be contained in a 480-bp DNA fragment encompassing the msd and msr 
regions (Furuichi et aL). 

The retron for Ec73 was determined to be contained in a 3.5-kb sall(b)-EcoRI(c) fragment (see FIG. 1A of Sun 
et aL). For details on Ec73, see below. 

The retron for Ec86 was determined to be contained in a 3.5~kb Pstl fragment (Lim and Maas, Cell, 56, 891-904 
1989). ~~ 

Likewise, from retrons Sa163, Ec86 and Ec73, the corresponding msDNAs, msDNA-Sa163, msDNA-Ec86 and 
msDNA-Ec73 may be produced in transfected yeast cells. If plant or mammalian vertebrate cells are used, appropriate 
manipulations and strategies will be followed. 

Similar techniques may be followed to express other msDNAs known or yet to be found or to be synthesized from 
their respective retrons. All of these retrons are expected to contain the elements necessary to synthesize the unique 
features of msDNAs, as is described herein. 

Thus, in general retrons containing the essential features described herein are useful to produce in vivo in eucary- 
otes the stable (not degraded) msDNAs having the conserved and characteristic features described herein. 

msDNA-Ec73 is synthesized from retron Ec73 which is described by Sun et al., Journ al of Bacteriology 173 
4171-4181, 1991. ~~ ' 

FIG. 2 therein shows the nucleotide sequence capable of synthesizing msDNA-Ec73, a 3.5-kb S(b)-E(c) fragment. 
It was determined that the first ATG codon at position 11,544 is the initiation for the necessary RT gene and the ORF 
for the RT is of 316 residues. 

It is to be noted that in ail retrons known to date, the RT gene is located at 20 to 77-bp upstream of the msd gene 
(downstream of the msd gene). 

In all retrons studied to date, it is believed that the promoter elements serve as the promoters for both msd RNA 
synthesis and the ORF. For instance, for msDNA-Ec67, promoter elements in a -1 0 region TTGACA and in a -35 region 
TGAAT, are believed to fulfill this function (see Lampson et a_L, Science , 243, 1033-1038, 1989) . However, in accord- 
ance with the invention, it is not essential that there be one promoter element for both components, but rather two 
promoter elements, one for initiating RNA polymerase transcription for the RT gene and the other for the msr - msd 
region. Thus the msr- msd region and the RT gene can be expressed under two independent promoters, which would 
be likely to complement each other. However, it appears at this time that at least for two of the msDNAs described 
herein (msDNA-Ec67 and msDNA-Ec73), the production of msDNA-Ec67 can only be complemented by the RT-Ec67 
and not by the RT-Ec73 or vice-versa. 

Further, it is often desired to use a strong promoter rather than the native promoter. 

Another use relates to the in vivo production in eucaryotes of any DNA fragment(s), non-native or foreign, to the 
msDNA structure. Likewise, the vectors and the transfected eucaryotic hosts, carrying such foreign DNA fragments), 
the synthesis in vivo in eucaryotes of stable msDNAs which encompass a foreign DNA fragment in the DNA portion 
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or a foreign RNA fragment in the DNA portion of the DNA-RNA hybrid structure are envisaged. Of particular interest 
are rnsDNAs which include a single-strand or DNA or RNA fragment which is complementary to the rnRNA of a particular 
target gene (or fragment) thereof (antisense DNA or RNA). 

In one example of this embodiment, there was constructed a plasmid, YEp521-M5, into which there was inserted 
s in the msd region, nucleotides 299-426 of YEp521-M4 (the boxed region of the lower strand of FIG. 7 of Lampson et 
aL, Science, 243, 1033-1038, 1989), a Xhol restriction recognition site (TiCTAG) ; a foreign DNA fragment of 50-bp 
was inserted in this Xhol site, YEp521-M5 was transformed into yeast (SP-1 ) and the subsequently expressed msDNA 
designated herein as msDNA-Ye117 (see FIG. 2B). 

In alike manner, there may be inserted into the msr region of YEp521-M4 a restriction recognition site, or a DNA 
io fragment. This retron may be transformed into yeast (SP-1 ), and the subsequently expressed msDNA is a new structure. 
It corresponds to the structure designated here as msDNA-Ye117, except that the new foreign fragment is in the RNA 
portion of the msDNA. 

Thus, the construction of a system that may be used to regulate the production of genes can be envisaged. The 
modified rnsDNAs contain in the DNA portion, a clone DNA fragment from a gene downstream of a promoter in the 

15 orientation promoting the production of antisense DNA or RNA (micRNAs). The "micRNA" terminology has been applied 
to an RNA transcript which is an mRNA-interfering-comptementary RNA (Coleman etaL, Cell, 37, 429-436 (1984) and 
literature references cited therein). "micRNA" has been reported to inhibit the production of certain proteins (e.g. , 
OmpF). A similar regulation has been reported for a micRNA and the gene for the Tn10 transposase gene. The gene 
for the micRNA and for the transposase are reported to be transcribed in apposite directions of the same segment of 

20 DNA, such that the 5' ends of their transcripts can form a complementary hybrid. The hybrid is thought to inhibit the 
translation of the transposase rnRNA. Coleman et aL, supra., report the construction of an artificial "mic" system de- 
signed to regulate the expression of any specific gene in E. cpj. 

Various cell division cycle (cdc) genes are known; by now some 50 different cdc genes have been defined in terms 
of landmark events occurring during duplication of cellular molecules (e.g., glycolic events). Various cdc genes and 

25 their functions are described in Watson etaL, Molecular Biology of the Gene , Fourth Ed. (1987), Chapter 18. Amongst 
these are cdc4 required for initiation of DNA synthesis in the mitotic cell division cycle and other functions; cdc7 of 
similar function to cdc4 but for premeiotic DNA synthesis; cdc28 necessary for duplication of the spindle pole body is 
homologous to mammalian protein kinases and has protein kinase activity, and others like cdc8, cdc9 and others. 
The strategy to produce a msDNA containing a foreign dsDNA fragment in its DNA portion is depicted in FIG. 10. 

30 The DNA fragment is shown (dark bar). The 50 bp nucleotide fragment has the following sequence (SEQ ID No. : 1): 

S * TCGATGTAATTTCCTAATOCACC^ 3 ' 
35 ACATTAAACGATTAAGTGGCGACTACAAGCTTC 5 ' 



Yeast cells (SP-1) transfected with YEp521-M5 produced a new msDNA-like structure, msDNA-Ye117, shown in 
FIG. 2B (analyzed by polyacrylamide-urea gel electrophoresis). This new msDNA construct contains the 50-bp DNA 
fragment. It is contemplated that the new msDNA is a useful vector for antisense DNA. The new construct is expected 
to produce a single-stranded DNA which is complementary to a specific rnRNA, in this instance, that of cdc28 and 
inhibit the expression of that rnRNA, and of the gene. 

Antisense DNA (micDNA) and micRNAs which are complementary to regions of the rnRNA known to interact with 
ribosomes, would be of particular interest. Hence, such rnsDNAs that contain such DNA-micDNA generating regions 
are of special interest for various applications. Thus, by inserting an appropriate DNA fragment of a gene after a pro- 
moter, e^, into an Xhol site, one can construct with the rnsDNAs disclosed herein (and others) a system to specifically 
regulate the expression of any gene. 

This is the first time that such antisense system has been provided from a molecule produced in an eucaryote. 

It is contemplated that other DNA fragments be inserted in the msd region and/or the msr region of the plasmid 
here disclosed and the corresponding new rnsDNAs synthesized which may have similar functions, e.g. , to generate 
a micDNA or a micRNA complementary to a rnRNA to inhibit its gene. 

Likewise, YEp521-M1 can be modified, producing an enlarged new msDNA structure (on the 5" end of the DNA 
portion of msDNA). 

When it is desired to insert a DNA sequence encoding a protein (polypeptide) e.g. , two copies of a gene, the DNA 
sequence will be inserted in opposite orientation to another at a selected restriction site into the msd sequence of an 
msDNA of choice, such as YEp521-M4. There is expected to be produced in an eucaryotic host, a novel msDNA-RNA 
structure. When the lacZ gene is incorporated into a suitable location in the msd region of the selected construct, It is 
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expected that p-gaiactosidase activity will be detected. 
EXAMPLES 

The following Examples are offered by way of illustration and are not intended to limit the invention in any manner. 
In these Examples, all percentages are by weight for solids and by volume for liquids, and all temperatures are in 
degrees Celsius unless otherwise noted. 

For convenience and clarity, the Examples refer to and provide also a detailed description of the Figures. 

Example 1 

Yeast Strains, Media and Growth Condition 

Yeast SP1 strain (a ura3 Ieu2 trpl his3 ade8 can r ga!2) was used. Cells were grown in YPD medium (1% yeast 
extract, 2% bactopeptone, and 2% glucose). For screening transformants of YEp52 and its derivatives, a minimal 
medium was used (Rose et aL. Methods in Yeast Genetic s; A to Course Manual , Cold Spring Harbor Lab., 

Cold Spring Harbor, NY, 1990), supplemented with ali nutrients required but leucine. For galactose induction, 0. 15ml 
of the pre-cultured ceils in the minimal medium containing 2% galactose instead of glucose were utilized. The cells 
were grown at 30° until late-log phase. Yeast transformations were carried out by the lithium acetate method (6). 
Transformation of yeast cells was confirmed as follows: the piasmid prepared from yeast transformants was trans- 
formed into £. coli DH-5 (F~ endA1 recAl hsdR17 (rk% k + ) supE 44 thi-1 , gyrA 96, relA 1 ) and the piasmid prepared from 
DH-5 cells was not yet subsequently characterized. Piasmid DNA from yeast cells was prepared according to the 
method described by Hoffman and Winston, Gene , 57, 268-272, 1987. 

Plasmids : YEp52 (Broach et aL, Experimental Manipulation of Gene Expression, Academic Press Inc., New York, 
1983) was used to construct plasmids for expression of msDNA in yeast This piasmid contains the ColEI origin of 
replication, a promoter of the GAL 10 gene, LEU2 , the 2u-circle origin of replication, and the 2u.-circle REP3 locus. 
Retron-Ec67 was prepared from piasmid pC1 1BPv4 in which the 4-kb Ball-Fvull fragment (DNA fragment from the 
BA1I to the 2nd Pvull site from the left end of the map described in FIG. 5 of Lampson et aL, Science , 243, 1033-1038, 
1989), was cioned into the Hindi site of pUC9. E. coli harboring this piasmid produces msDNA-Ec67. A total RNA 
fraction was prepared from cells transformed with pC1-1EP5c. pCL-1EP5c contains the 5-kb Pstl(a)~EcoRI fragment 
encompassing the entire 4-kb Ba1-I-Pyull sequence in pC1-1BPv4 (see FIG. 5 of Lampson et aL, Science, 243, 
1033-1038, 1989) in pUC9. 

Piasmid Construction : piasmid YEp521 was constructed by introducing the multiple cloning sites of pUC19 (Ya- 
nisch-PerronetaL, Gene, 33, 103-119, 1985) into YEp52 (Broach etal, B^e rimentaj Manipulation of Gene Expression , 
Academic Press Inc., New York, 1983), which was designed to obtain high-level, inducible expression of a cloned gene 
under the GAL 10 promoter in yeast. The DNA fragment containing the pUC19 multiple cloning site was isolated by 
digestion of pUC19 with EcoRi; the cleaved ends were filled in with the Klenow fragment of DNA polymerase 1, and 
then digested with HindHL The resulting 54-bp fragment was cloned into YEp52 by replacing a fragment between the 
Bell (filled in with the Klenow fragment) and Hindlll sites, resulting in YEp521 . YEp521 , thus constructed, contains the 
multiple cloning sites from pUC19, except for EcoRi, downstream of the GAL 10 promoter. The 4-kb Hindlll- Bam HI 
fragment from pC1-1BPv4 was cloned into the Hindlll and Bam HI sites of YEp521 . As a result, the msr - msd region 
and the RT gene of retron-Ec67 were placed downstream of the GAL 10 promoter. This piasmid is designated 
YEp521-M1 as shown in FIG. 7. 

In order to eliminate a fragment of 242 bases upstream of msr which contains several ATG codons, polymerase 
chain reaction (PCR) was performed using YEp521-M1 as a template with two synthetic oligonucleotides, M2-a 
( S 'GCAAGCTTCATAAACACGCATGT 3 ': SEQ ID No.: 2) and M2-b ( 5 'CTGGATCCAGAAACGCATGCAGG 3 ': SEQ ID 
No. : 3) as primers. 

These sequences correspond to the sequences from base 243 to 258 of retron Ec67 for M2-a and from base 384 to 
369 for M2-b (see FIG. 7 of Watson et ai. < 22 >), which flank the msr-msd region. The 140-bp PCR product was gel- 
purified and digested with Hindlll and Bam HI. The resulting fragment was cloned into the Hind ill and Bam HI sites of 
YEp521, yielding YEp521-M2. YEp521-M2 contains only the msr- msd region under the GAL 10 promoter. 

To insert the RT gene at the Bam HI site of YEp521-M2, the 1 .8-kb Bam Hi fragment encompassing the RT gene 
was amplified by PCR using YEp521-M1 as a template and two oligonucleotides, M3-a 
( 5 'CTGGATCCAAGAAATGACAAAAACA 3 ': SEQ ID No.: 4) and M3-b( 5 'CTGGA TCCTT CATTA GCTAT TTAAC AT 3 ': 
SEQ ID No.: 5) as primers which correspond to base 409 to 429 and from base 2182 to 2163 of retron-Ec67 (see FIG. 
7 of Lampson etaL, Science , 243 , 1033-1038, 1989), respectively. The 1 .8-kb fragment was gel -purified, digested with 
Bam Hi, and closed into the BamHI site of YEp521-M2. The resulting piasmid was designated YEp521-M3. 

YEp521-M4 was constructed to change the order of the msr-msd region and the RT gene. The msr - msd region 
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was amplified by PCR using M2-a and M2-b (see above) except that S ma I sites were added at their 5' ends. The 1 ,8-kb 
BamHI fragment containing the RT gene was cloned into the BamHI site of YEp521. Subsequently, the 140-bp Srnal 
fragment containing the msr-msd region was cloned into the Smal site of the above plasmid and the resulting plasmid 
was designated YEp521-M4. 

YEp521-M5 was constructed from YEp521-M4 to add the 50-bp antisense DNA for cdc28 (Xhol fragment) into the 
rr>sd region. The Xhol site was added into the msd region of YEp521-M4 by PCR. This construct was then digested 
by Xho l; then the antisense DNA was ligated to the msd region of retron Ec67. This plasmid was transformed into 
yeast (SP-1) and the subsequently expressed msDNA designated herein as msDNA-Ye117. 

Detection of msDNA : a total RNA fraction from yeast cells was prepared as described by Elder et aL, Proc. Natl. 
Acad- Sci. USA , 80, 2432-2436, 1983 and a total RNA fraction from E. coj was prepared from E^ colt harboring 
pC1-1EP5c by the method described by Chomzynski et aL, Anal. Biochem., 162, 156-159, 1987. 

To label msDNA with reverse transcriptase, the total RNA fraction prepared from 0.9ml of a late-log culture was 
added to 20 ul of a reaction mixture containing 30mM Tris-HCl (pH 8.2), 50mM KCI, 10mM MgCI 2 , 5mM DTT, 0. 2mM 
each of dTTp, dGTP, dCTP, 5 \xd of [y- 32 P]dATP and 5 units of avian myeloblastosis virus reverse transcriptase (AMV- 
RT; Molecular Genetic Resources). The reaction mixture was incubated at30°C for 1 hour, and an aliquot of the reaction 
mixture was subjected to electrophoresis with a 6% polyacrylamide -8M urea gel. Another aliquot was treated with 
RNase A (10 ug/mi) for 10 minutes at 37°C and subjected to electrophoresis. 

msDNA-Ec67 was also detected by Southern blot analysis (Southern, Mol. Biol., 98, 503-517, 1975). Total RNA 
from 2.5ml of a late-log culture was applied to a 1.5% agarose gel with E buffer [40mM Tris HCI(pH 8.0), 10mM sodium 
acetate, 2mM EDTA). After electrophoresis, the gel was blotted to a nylon membrane filter (PALL BLODYNE A TRANS- 
FER MEMBRANE; ICN) by the capillary transfer method. Hybridization was carried out in 50% (v/v) formamide, 5 x 
SSPE [1 x SSPE; 180mM NaCl, 10mM sodium phosphate (pH 7. 4), 1mM EDTA], 0.3% sodium dodecyl sulfate, and 
5 x Denhardfs solution (Denhardt, Biochem. Biop hys.Res. Commun. , 23, 641-646 (1966)) with the nick-translated 
140-bp msr-msd region as a probe (Rig by et aL, J. MoL Biol., 113 , 237-251, 1977). 

As noted above, the method of the invention provides for the expression of the desired msDNAs from eucaryotes. 
While the invention has been illustrated with a yeast of the genus Saccharomyces , other eucaryotes are readily suitable 
to practice the invention. 

A convenient source of suitable yeasts is found in the ATCC C at alogue of Yeasts , 18th Ed., 1990. Because of 
practical and economic importance, the invention is particularly directed to the genus Saccharomyces which is exten- 
sively used in baking, beer, wine and other industries. Conventionally these yeasts are referred to as baker's, brewer's 
and wine yeasts. 

Amongst these, of special interest are the S. cerevisiae strains, the S. bayanus , S. carlsbergenensis , S. diataticus , 
and S. uvarum , which lend themselves to transformation with the vectors of the invention. Further, to express the 
msDNAs, one may use vertebrate host cells like COS-1, CHO and HeLa cells or invertebrate cells or plant cells. 

Plants that may be used include monocotyledons and dicotyledons. Illustrative examples of plants which may be 
transformed are the following: alfalfa, soybeans, maize and wheat (Genetic Engineering of Plants, An Agricultural 
Perspective , Edited by Kosuge etaJL, Plenum Press (1983)). 

Various cloning vectors may be used to transfect compatible eucaryotic host cells for replication of the vector. 
Thereafter the transformants are identified, plasmid DNA prepared therefrom, and the msDNAs extracted and purified. 

Vectors for expression of cloned genes in yeasts are described in Methods in Enzymology , Vol. 194, "Guide to 
Yeast Genetics and Molecular Biology", page 373 (Guthrie and Fink, Eds., Academic Press Inc. , 1991). It will be 
apparent from one skilled in the art to select an appropriate promoter for expressing msDNAs in yeast with or without 
a foreign DNA fragment, such as from regulatable promoters of the GAL family, ejj., GAL4, GAL80 , GAL1, GAL2, 
GAL7 , GAL10 , GAL 11, MEL1, ADH1 and PGK; also see Broach et at., Experimental Manipulation of Gene Expression , 
Academic Press, Inc., New York, 1983 or non-regulatable strong promoters. 

Oligonucleotide synthesis may be carried out by a number of methods including those disclosed in U.S. Pat. No. 
4, 415, 734, and in Matteuci et aL, J. Am. Chem. Soc , 103 (11):3185-3191 (1981), Adams etal., J. Chem. Soc , 105 
(3): 661-663 (1983), and Bemcage et aL, Tetrahedron Letters , 22 (20): (20):1 859-1 867 (1981). 

For the expression of msDNAs in higher eucaryotes with or without selected DNA fragment, one skilled in the art 
may refer to and use known techniques. The advantages of synthesizing particular eucaryotic proteins in eucaryotes 
are well known. Depending on the msDNA which is intended to be produced, an appropriate eucaryote host cell will 
be selected (see Molecular Cloning: A Laboratory Manual . Second Edition, §3, §16.3 and seg. (Sambrook et al., Cold 
Spring Harbor Laboratory Press, 1989)). The eucaryotic expression vehicle will contain, as is known, a promoter and 
enhancer elements, recognition sequences, the TATA box and upstream promoter elements. Other conventional ele- 
ments located upstream of the transcription initiation site for replication and selection are known and described in 
standard laboratory manuals. Vectors are available commercially, for instance from Pharmacia (pMSG, pSVT17, 
pMT2). For methods for introducing recombinant vectors into mammalian cells, see Molecular Cloning: A Laboratory 
Manual , Second Edition, §16. 30-16. 55, (Sambrook et al., Cold Spring Harbor Laboratory Press, 1989). For cosmid 
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vectors for transfection of mammalian cells, see Mole cular Cloning: A Laboratory Manual, Second Edition, §23.18 and 
seq. , (Sambrook et aL, Cold Spring Harbor Laboratory Press, 1 989). 

Further, one skilled in the art may wish to refer to Current Protocol s In Molecular Biology , Volume 1, §16. 12-16. 
13.7 (Ausubel et aL , Eds., , Greene Publishing Associates and Wiley-lnterscience, 1989), discussing in particular, 
three vector systems or strategies for introducing foreign genes into mammalian cells with COS cells, CHC and vaccinia 
virai vectors. One skilled in the art will select the most appropriate system for the production of msDNAs from the 
selected retrons, and further, for introduction of DNA into mammalian cells (see Current Protocols In Molecul ar Biology, 
Volume 1, §9. 01-9.93 {Ausubel et aj., Eds., Greene Publishing Associates and Wiley-Tnterscience, 1989)). 

The msDNAs have several interesting utilities. 

A fascinating utility that is being considered is the role that msDNAs can play on the formation of triple helix DNA, 
or triplex DNA with a specific duplex on the chromosome. A recent report in Science , 252, 1374-1375 (June 27, 1991), 
"Triplex DNA Finally Comes of Age", highlights the timeliness of the present invention. Triplex DNA can be formed by 
binding a third strand to specific recognized sites on chromosomal DNA. Synthetic strands of sizes preferably containing 
the full complement of bases (such as 11-15 and higher), are discussed. The msDNAs with long 3"(or 5') ends (and 
the loop of non-duplexed bases) would appear to be excellent candidates. These regions provide single-stranded DNA 
necessary for the triplex formation. The resulting triplex DNA is expected to have increased stability and usefulness. 
New therapies based on the triple helix formation, including in AIDS therapy and selective gene inhibition and others 
are proposed in the Report. 

Artificial, synthetic msDNAs can be designed and may be used as antisense DNAs, and/or RNAs and/or ribozymes 
using the single-stranded DNA or RNA region of msDNAs. Such msDNA (containing a foreign ssDNA or ssRNA frag- 
ment) for use as antisense system, been described above. The production of an msDNA with complementarity with a 
gene (or portion) thereof, blocks the synthesis of the specific protein itself. The msDNA system produced in eucaryotic 
cells to generate a desired complementary DNA of an mRNA of a gene, appears to have real potential in eucaryotic 
cells to block the expression of various harmful or toxic genes, such as drug resistance, oncogenes, and phages or 
viruses. The system could have applications to AIDS therapy. Of special interest, are the msDNAs that would be 
produced by HeLa cells and containing such selected DNA fragment for use in antisense applications. 

As described above, it is contemplated that genes be inserted for instance, in the the stem region(s) of the msDNAs. 
Thus the msDNAs may be used for amplification of the selected gene. 

The polymerase chain reaction (PCR) is a well-known rapid procedure for in vitro enzymatic amplification of a 
specific segment of DNA. The standard PCR method requires a segment of double-stranded DNA to be amplified, and 
always two-single stranded oligonucleotide primers flanking the segment, a DNA polymerase, appropriate deoxyribo- 
nucfeoside triphosphate (dNTPs), a buffer, and salts ( Current Protocols , Section 15). 

Thus, the msDNAs due to their unique structure (and stability), are expected to be of value in numerous applications 
in the biochemical, medical, pharmaceutical and other biological sciences. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: University of Medicine and Dentistry of New Jersey 

(ii) TITLE OF INVENTION: Method for synthesizing stable single-stranded cDNA in eukaryotes by means of a 
bacterial retron, products ans uses therefor. 
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(ill) NUMBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS: 

5 (A) ADDRESSEE: Cabinet Beau de Lomenie 

(B) STREET: 55, Rue d'Arnsterdam 

(C) CITY: Paris 

(E) COUNTRY: FRANCE 

(F) ZIP: 75 008 

10 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

is (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1,25 

(vii) PRIOR APPLICATION DATA: 

20 (A) APPLICATION NUMBER: US 07/753,110 

(B) FILING DATE: 30-AUG-1991 

(viii) ATTORNEY/AGENT INFORMATION: 

25 (A) NAME: GlLLARD, Marie-Louise 

(C) REFERENCE/DOCKET NUMBER: MLG.MHC/J 16983-4 

(ix) TELECOMMUNICATION INFORMATION: 

30 (A) TELEPHONE: 42 80 64 68 

(B) TELEFAX: 48 74 37 60 

(C) TELEX: 650476F 

{2) INFORMATION FOR SEQ ID NO:1: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
*5 (iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 



50 

TCGATGTAAT TTGCTAATTC ACCGCTCATG TTCOAAGGAT AGTTCTATTT GATC 54 



(2) INFORMATION FOR SEQ ID NO:2: 

55 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
GCAACCTTCA TAAACACGCA TOT 23 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CTGGATCCAG A.AACGCATGC AGG 23 
(2) INFORMATION FOR SEQ ID NO:4: 

I 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

CTGGATCCAA GAAATGACAA AAACA 25 

(2) INFORMATION FOR SEQ ID NO:5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
CTGGATCCTT CATTAGCTAT TTAACAT 



Claims 

1 . A method for producing a stable msDNA in a eucaryotic host cell which comprises transfecting the cell with a DNA 
expression vector which replicates in the cell, which vector contains a retron encoding the msDNA, which retron 
contains msr and msd coding regions (msr - msd ) for the msDNA and a gene encoding a reverse transcriptase 
(RT), and expressing the msDNA from the retron. 





2. 


The method of claim 1 


wherein 


20 


3. 


The method of claim 2 


wherein 




4. 


The method of claim 3 


wherein 




5. 


The method of claim 1 


wherein 


25 


6. 


The method of claim 3 


wherein 




7. 


The method of claim 3 


wherein 


30 


8. 


The method of claim 7 


wherein 



9. The method of claim 3 wherein the expression vector contains a promoter for a retron consisting of msr-msd region 
and a RT gene, which promoter is located upstream of msr. 

10. The method of claim 9 wherein the promoter is a foreign strong promoter. 

11. The method of claim 10 wherein the strong promoter is GAL10. 

12. The method of claim 2 wherein in the retron in the plasmtd, the msd sequence contains a cloning site. 

13. The method of claim 8 wherein a foreign DNA fragment is contained in the msd region of the retron which codes 
the DNA of the msDNA. 

14. The method according to anyone of claims 1-13 wherein the retron is in a vector selected from the group consisting 
of YEp521-M1 (ATCC N* 74092), YEp521~M3, YEp521-M4 (ATCC N° 74093), and YEp521-M5 (ATCC N° 74094). 

15. The method of claim 4 wherein the msDNA molecule comprises a branched single-stranded RNA which is cova- 
lently linked to a single-stranded DNA by a 2',5'~phosphodiester bond between the 2-OH group and an internal 
rG residue and the 5'-phosphate of the DNA molecule, which RNA is non-covalently linked to the DNA by base- 
pairing between the complementary 3' ends of the RNA and DNA molecules, which RNA-DNA form stable stem- 
loop secondary structures, which msDNA is encoded by a primary RNA transcript, pre-msDNA which contains an 
open-reading frame (ORF) downstream of the msr locus, the ORF encoding a polypeptide which has sequence 
similarity with retroviral RTs and a highly conserved sequence common to all RTs. 

16. The method of claim 15 wherein the stable msDNA which is produced is selected from the group consisting of 
msDNA-Ec67, mSDNA-h4x162, msDNA-Mx65, msDNA-Ec107, msDNA-Mx86 and msDNA-Sal63, as shown in 
Figure 2 A. 
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17. The method of claim 4 wherein the ORF is located upstream of the msr locus. 

18. The method of claim 17 wherein the stable msDNA which is produced is msDNA-Ec67, as shown in Figure 2A. 

19. A DNA expression vector which replicates in an eucaryotic host which comprises a retron encoding a stable hybrid 
single-stranded RNA-DNA (msDNA) structure, which retron contains a gene encoding a reverse transcriptase (RT) 
and one coding region which contains two coding sequences, one ''msr" and the other "msd" for encoding, respec- 
tively, the RNA and DNA portions of the msDNA. 

20. The DNA expression vector of claim 19 wherein in the retron. the RT gene is downstream of the msr-msd region. 

21. The vector of claim 20 wherein the retron contains a single AUG cod on for the RT gene. 

22. The vector of claim 21 wherein in the retron, the RT gene is upstream of the msr-msd region. 

23. The vector of claim 19 which includes a promoter for the msr-msd region and the RT gene, which promoter is 
located upstream of msr. 

24. The vector of claim 23 wherein the promoter is a foreign strong promoter. 

25. The vector of claim 24 wherein the promoter is GAL 10. 

26. The vector of claim 22 in which a foreign DNA fragment is contained in the msd region of the retron which encodes 
the DNA of the msDNA. 

27. The vector of claim 26 wherein the fragment is 50-bp long. 

28. The vector of claim 20 which is YEp521-M1 (ATCC N° 74092), YEp521-M3, YEp521-M4 (ATCC N° 74093) or 
YEp521-M5 (ATCC N° 74094). 

29. A eucaryotic host cell transfected with a DNA expression vector which replicates in the cell, which vector contains 
a retron encoding a msDNA, which retron contains msr and msd coding regions (msr-msd) for the msDNA and a 
gene encoding a reverse transcriptase (RT). 

30. The eucaryotic host cell of claim 29 which is selected from the group consisting of yeast, plant cells and mammalian 
cells. 

31. The eucaryotic host cell of claim 30 which is yeast. 

32. The eucaryotic host cell of claim 31 wherein the yeast is of the genus Saccharomyces. 

33. The eucaryotic host cell of claim 30, 31 or 32 wherein in the retron, the RT gene is downstream of the msr -msd 
region. 

34. The eucaryotic host cell of claim 30 wherein the retron contains a single AUG codon for the RT gene. 

35. The eucaryotic host cell of claim 34 wherein in the retron, the RT gene is upstream of the msr - msd region. 

36. The eucaryotic host cell of claim 30 wherein the expression vector contains the promoter for the msr-msd region 
and RT gene, which promoter is located upstream of msr. 

37. The eucaryotic host cell of claim 36 wherein the promoter is a foreign strong promoter. 

38. The eucaryotic host cell of claim 37 wherein the strong promoter is GAL10. 

39. The eucaryotic host cell of claim 35 which contains a retron having a msd region which contains a foreign DNA 
fragment. 
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40. The eucaryotic host cell of claim 39 wherein the DNA sequence is 50-bp long. 



Patentansprtiche 

1 . Verfahren zur Herstellung einer bestandigen ms-DNA in einer eukaryotischen Wirtszelle, das die Transfektion der 
Zelle mit einem DNA-Expressionsvektor, der in derZelle repltztert, und ein die ms-DNA kodierendes Retron enthalt, 
das eine msr- und eine msd-Kodierungsregion (msr-msd) fur die ms-DNA sowie ein cine Reverse-Transcriptase 
(RT) kodierendes Gen enthalt, und die Expression der ms-DNA aus dem Retron umfaftt. 

2. Verfahren nach Anspruch 1, bei dem der Expressionsvektor ein Plasmid ist. 

3. Verfahren nach Anspruch 2, bei dem die eukaryotische Wirtszelle Hefe ist. 

4. Verfahren nach Anspruch 3, bei dem die Hefe der Gattung Saccharomyces angehort. 

5. Verfahren nach Anspruch 1 , bei dem die eukaryotische Wirtszelle eine Pflanzen- Oder Saugerzelle ist. 

6. Verfahren nach Anspruch 3, bet dem im Retron das RT-Gen unterhalb der msr-msd-Region angeordnet ist. 

7. Verfahren nach Anspruch 3, bei dem das Retron ein esnziges AUG-Kodon fur das RT-Gen enthalt. 

8- Verfahren nach Anspruch 7, bei dem im Retron das RT-Gen oberhalb der msr-msd-Region angeordnet ist. 

9. Verfahren nach Anspruch 3, bei dem der Expressionsvektor einen Promoter fur ein aus der msr-msd-Region und 
einem RT-Gen bestehendes Retron enthalt und der Promoter oberhalb der msr-Region angeordnet ist. 

10. Verfahren nach Anspruch 9, bei dem der Promoter ein fremder starker Promoter ist. 

11. Verfahren nach Anspruch 10, bei dem der starke Promoter GAL 10 ist. 

12. Verfahren nach Anspruch 2, bei dem im Retron im Plasmid die msd-Sequenz einen Klonungssite enthalt. 

1 3. Verfahren nach Anspruch 8, bei dem ein fremdes DNA-Fragment in der msd-Region des Retron enthalten ist, das 
die DNA der ms-DNA kodiert. 

14. Verfahren nach einem der Anspruche 1 bis 13, bei dem das Retron in einem Vektor enthalten ist, ausgewahlt aus 
der Gruppe, bestehend aus YEp521-M1 (ATCC Nr. 74092), YEp521-M3, YEp521-M4 (ATCC Nr. 74093) und 
YEp521-M5 (ATCC Nr. 74094). 

15. Verfahren nach Anspruch 4, bei dem dasms-DNA-Molekul eine verzweigte einstrangige RNA umfa&t, die mit einer 
einstrangigen DNA durch eine 2\5*-Phosphodiester-Bindung zwischen der 2'-OH~Gruppe von einem inneren rG- 
Rest und dem 5'-Phosphat des DNA-Molekuls kovafent verknupft ist und mit der DNA durch Basenpaarung zwi- 
schen den komplementaren 3'-Enden der RNA- und DNA-fVtolekiiie nicht-kovalent verknupft ist, die RNA-DNA 
bestandige sekundare Stem-Loop-Strukturen bilden, die ms-DNA durch ein primares RNA-Transcript, die pra ms- 
DNA, kodiert wird, die einen offenen Leserahmen (ORF) unterhalb des msr-Locus enthalt, der ein Polypeptid 
kodiert, das mit retroviralen RT's Sequenzahnlichkeitaufweistund eine mitsamtlichen RT'sgemeinsame, in hohem 
Mafie konservierte Sequenz umfafit. 

16. Verfahren nach Anspruch 15, bei dem die bestandige ms-DNA, die produziert wird, aus der Gruppe, bestehend 
aus ms~DNA-Ec67, ms-DNA-Mx162, ms-DNA-Mx65, ms-DNA-Ec107,ms-DNA~Mx86 und ms-DNA-Sa163 T wie in 
Fig. 2A gezeigt, ausgewahlt wird. 

17. Verfahren nach Anspruch 4, bei dem der ORF oberhalb des msr-Locus angeordnet ist. 

18. Verfahren nach Anspruch 17, bei dem die bestandige ms-DNA, die produziert wird, wie in Fig. 2A gezeigt, ms- 
DNA-Ec67 ist. 
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19. DNA-Expressionsvektor, der in einem eukaryotischen Wirt repliziert, der ein Retron umfa&t, das eine bestSndige 
einstrangige Hybrid-RNA-DNA (ms-DNA)-Struktur kodiert und ein Gen, das eine Reverse-Transcriptase (RT) ko- 
diert, sowie eine Kodierungsregion enthalt, die zwei Kodierungssequenzen umfaftt, und zwar eine msr-Sequenz 
fur die Kodierung des RNA-Anteils und eine msd-Sequenz fur die Kodierung des DNA-Anteils derms-DNA. 

20. DNA-Expressionsvektor nach Anspruch 19, bei dem im Retron das RT-Gen unterhalb der msr-msd-Region ange- 
ordnet ist 

21. Vektor nach Anspruch 20, bei dem das Retron ein einziges AUG-Kodon fur das RT-Gen enthalt. 

22. Vektor nach Anspruch 21, bei dem im Retron das RT-Gen oberhalb der msr-msd-Region angeordnet ist. 

23. Vektor nach Anspruch 19, der einen Promoter fur die msr-msd-Region und das RT-Gen umfaftt und der Promoter 
oberhalb der msr-Region angeordnet ist. 

24. Vektor nach Anspruch 23, bei dem der Promoter ein fremder starker Promoter ist 

25. Vektor nach Anspruch 24, bei dem der Promoter GAL 10 ist. 

26. Vektor nach Anspruch 22, bei dem ein fremdes DNA-Fragment in der msd-Region des Retron enthaiten ist, das 
die DNA der ms-DNA kodiert. 

27. Vektor nach Anspruch 26, bei dem das Fragment 50 bp lang ist. 

28. Vektor nach Anspruch 20, der YEp521-M1 {ATCC Nr. 74092), YEp521-M3 ( YEp521-M4 (ATCC Nr. 74093) Oder 
YEp521-M5 (ATCC Nr. 74094) darstellt 

29. Eukaryotische Wirtszelle, die mit einem DNA-Expressionsvektor transfiziert ist, der in der Zelle repliziert, wobei 
der Vektor ein eine ms-DNA kodierendes Retron enthalt, das eine msr- und eine msd-Kodierungsregion (msr-msd) 
fur die ms-DNA sowie ein eine Reverse-Transcrtptase (RT) kodierendes Gen enthalt. 

30. Eukaryotische Wirtszelle nach Anspruch 29, die aus der Gruppe, bestehend aus Hefe, Pflanzen- und Saugerzellen, 
ausgewckhlt wird. 

31. Eukaryotische Wirtszelle nach Anspruch 30, die Hefe darstellt. 

32. Eukaryotische Wirtszelle nach Anspruch 31 , bei der die Hefe der Gattung Saccharomyces angehort 

33. Eukaryotische Wirtszelle nach Anspruch 30, 31 Oder 32, bei der im Retron das RT-Gen unterhalb der msr-msd- 
Region angeordnet ist. 

34. Eukaryotische Wirtszelle nach Anspruch 30, bei der das Retron ein einziges AUG-Kodon fur das RT-Gen enthalt. 

35. Eukaryotische Wirtszelle nach Anspruch 34, bei der im Retron das RT-Gen oberhalb der msr-msd-Region ange- 
ordnet ist. 

36. Eukaryotische Wirtszelle nach Anspruch 30, bei der der Expressionsvektor den Promoter fur die msr-msd-Region 
und das RT-Gen enthalt und der Promoter oberhalb der msr-Region angeordnet ist. 

37. Eukaryotische Wirtszelle nach Anspruch 36, bei der der Promoter ein fremder starker Promoter ist. 

38. Eukararotische Wirtszelle nach Anspruch 37, bei der der starke Promoter GAL 10 ist. 

39. Eukaryotische Wirtszelle nach Anspruch 35, die ein Retron mit einer msd-Region enthalt, die ein fremdes DNA- 
Fragment enthalt. 

40. Eukaryotische Wirtszelle nach Anspruch 39, bei der die DNA-Sequenz 50 bp lang ist. 
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Revindications 

1. Precede de production d'un ADNms stable dans une cellule h6te eucaryote, qui cornprend la transfection de la 
cellule avec un vecteur d'expression d'ADN qui se replique dans la cellule, ce vecteur contenant un retron codant 
pour I'ADNms, lequel retron contient les regions cod antes msr et msd (msr-msd) pour I'ADNms et un gene codant 
pour une transcriptase inverse (RT), et ! 'expression de I'ADNms a partir du retron. 

2. Procede selon la revendication 1, dans lequel le vecteur d'expression est un plasmide. 

3. Procede selon la revendication 2, dans lequel la cellule hate eucaryote est une levure. 

4. Procede selon la revendication 3, dans lequel la levure est du genre Saccharomyces. 

5. Procede seion la revendication 1 , dans lequel la cellule h6te eucaryote est une cellule vegetale ou une cellule de 
mammifere. 

6. Procede selon la revendication 3, dans lequel, dans le retron, le gene de RT est en aval de la region msr-msd. 

7. Procede selon la revendication 3, dans lequel le retron contient un seul codon AUG pour le gene de RT 

8. Procede selon la revendication 7, dans lequel, dans le retron, le gene de RT est en amont de la region msr-msd, 

9. Procede seion la revendication 3, dans lequel le vecteur d'expression contient un promoteur pour un retron cons- 
titue d*une region msr-msd et d'un gene de RT, ce promoteur etant situe en amont de msr. 

10. Procede selon la revendication 9, dans lequel le promoteur est un promoteur fort etranger. 

11. Procede selon la revendication 10, dans lequel le promoteur fort es\GAL10. 

12. Procede selon la revendication 2, dans lequel, dans le retron contenu dans le plasmide, la sequence msd contient 
un site de clonage. 

13. Procede selon la revendication 8, dans lequel un fragment d'ADN etranger est contenu dans la region msd du 
retron qui code pour I'ADN de I'ADNms. 

14. Procede selon I'une quelconque des revendications 1 a 13, dans lequel le retron est contenu dans un vecteur 
choisi dans le groupe constitue par YEp521-M1 (ATCC n° 74092), YEp521-M3, YEp521-M4 (ATCC n Q 74093) et 
YEp521-M5 (ATCC n° 74094). 

15. Procede selon la revendication 4, dans lequel la molecule d'ADNms cornprend un ARN simple brin ramifie, lie par 
covalence a un ADN simple brin par une liaison phosphodiester 2' ,5' entre le groupe 2'-OH d'un residu rG interne 
et le phosphate en 5' de la molecule d'ADN, cet ARN est lie de maniere non covalente a I'ADN par un appariement 
de bases entre les extremites complementaires 3' des molecules d'ARN et d'ADN, ce systeme ARN-ADN forme 
des structures secondaires de type tige-boucle stables, cet ADNms est code par un transenpt ARN primaire, un 
pre-ADNms qui contient un cadre de lecture ouvert (ORF) en aval du locus msr, I'ORF codant pour un polypeptide 
qui a une analogie de sequence avec des RT retrovirales et une sequence hautement conservee commune a 
toutes les RT. 

16. Procede selon la revendication 15, dans lequel I'ADNms stable qui est produtt est choisi dans ie groupe constitue 
par l'ADNms-Ec67, rADNms-Mx162, I'ADN ms-Mx65, !'ADNms-Ec107, l'ADNms-Mx86 et !'ADNms-Sal63, telsque 
presentes dans la figure 2A. 

17. Procede selon la revendication 4, dans lequel I'ORF est situe en amont du locus msr. 

18. Procede selon la revendication 17, dans lequel I'ADNms stable qui est produit est l'ADNms-Ec67, te! que presente 
dans la figure 2A. 

19. Vecteur d'expression d'ADN qui se replique dans un note eucaryote et qui cornprend un retron codant pour une 
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structure ARN-ADN simple brin hybride stable (ADNms), ce retron contenant un gene codant pour une transcrip- 
tase inverse (RT) et une region cod ante contenant deux sequences eodantes, I'une msr et I'autre msd, codant 
respectivement pour les portions d'ARN et d'ADN de I' ADNms. 

20. Vecteur d'expression d'ADN selon la revendication 19, dans lequel dans !e retron, le gene de RT est en aval de 
la region msr-msd. 

21. Vecteur selon la revendication 20, dans lequel le retron contient un seul codon AUG pour le gene de RT. 

22. Vecteur selon la revendication 21, dans lequel, dans le retron, le gene de RT est en amont de la region msr-msd, 

23. Vecteur selon la revendication 19, qui comprend un promoteur pour la region msr-msd et le gene de RT, ce pro- 
moteur etant situe en amont de msr. 

24. Vecteur seion la revendication 23, dans lequel le promoteur est un promoteur fort etranger. 

25. Vecteur selon la revendication 24, dans lequel le promoteur est GAL 10. 

26. Vecteur selon la revendication 22, dans lequel un fragment d'ADN etranger est contenu dans la region msd du 
retron qui code pour PADN de I'ADNms. 

27. Vecteur selon la revendication 26, dans lequel le fragment a une longueur de 50 bp. 

28. Vecteur selon la revendication 20, qui est YEp521-M1 (ATCC n° 74092), YEp521-M3, YEp521-M4 (ATCC n° 
74093) ou YEp521~M5 (ATCC n° 74094). 

29. Cellule note eucaryote transferee avec un vecteur d'expression d'ADN qui se replique dans la cellule, ce vecteur 
contenant un retron codant pour un ADNms, ce retron contenant des regions eodantes msr et msd (msr-msd) pour 
I'ADNms et un gene codant pour une transcriptase inverse (RT). 

30. Cellule h6te eucaryote selon la revendication 29, qui est choisie dans le groupe constitue par des ievures, des 
cellules vegetales et des cellules de mammifere. 

31. Cellule hdte eucaryote selon la revendication 30, qui est une levure. 

32. Cellule h6te eucaryote selon la revendication 31 , dans laquelle la levure est du genre Saccharomyces. 

33. Cellule note eucaryote selon la revendication 30, 31 ou 32, dans laquelle, dans le retron, le gene de RT est en 
aval de la region msr-msd. 

34. Cellule note eucaryote selon la revendication 30, dans laquelle le retron contient un seul codon AUG pour le gene 
de RT 

35. Cellule note eucaryote selon la revendication 34, dans laquelle, dans, le retron, le gene de RT est en amont de la 
region msr-msd. 

36. Cellule hdte eucaryote selon la revendication 30, dans laquelle le vecteur d'expression contient le promoteur pour 
la region msr-msd et le gene de RT, ce promoteur etant situe en amont de msr. 

37. Cellule note eucaryote selon la revendication 36, dans laquelle le promoteur est un promoteur fort etranger. 

38. Cellule note eucaryote selon la revendication 37, dans laquelle le promoteur fort est GAL10. 

39. Cellule hote eucaryote selon la revendication 35, qui contient un retron ayant une regionmsd contenant un fragment 
d'ADN etranger. 

40. Cellule hote eucaryote selon la revendication 39, dans laquelle la sequence d'ADN a une longueur de 50 bp. 
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